Towards Universal Multilingual Knowledge Bases
نویسندگان
چکیده
Lexical, ontological, as well as encyclopedic knowledge is increasingly being encoded in machine-readable form. This paper deals with knowledge representation in multilingual settings. It begins by proposing a generic graph-based knowledge base framework, and then, in three case studies, explains how preexisting knowledge can be cast into this framework. The first case study involves enriching WordNet with information about human languages and their relationships. The second study shows how machine learning techniques can be used to bootstrap a large-scale multilingual version of WordNet where semantic relationships between terms in many languages are captured. The final study examines how information can be extracted from Wiktionary to produce a lexical network of etymological and derivational relationships between words.
منابع مشابه
ULiS: An Expert System on Linguistics to Support Multilingual Management of Interlingual Knowledge bases
We are interested in bridging the world of natural language and the world of the semantic web in particular to support multilingual access to the web of data, and multilingual management of interlingual knowledge bases. In this paper we introduce the ULiS project, that aims at designing a pivot-based NLP technique called Universal Linguistic System, 100% using the semantic web formalisms, and b...
متن کاملMultilingual Evaluation of KnowNet Evaluación Multiĺıngüe de KnowNet
This paper presents a new fully automatic method for building highly dense and accurate knowledge bases from existing semantic resources. Basically, the method uses a wide-coverage and accurate knowledge-based Word Sense Disambiguation algorithm to assign the most appropriate senses to large sets of topically related words acquired from the web. KnowNet, the resulting knowledge-base which conne...
متن کاملGeneration of Multilingual Personalized Environmental Bulletins from an OWL-based Ontology
Natural Language Generation (NLG) from knowledge bases (KBs) has repeatedly been subject of research. However, most proposals tend to have in common that they start from KBs of limited size that either already contain linguistically-oriented knowledge structures or to whose structures different ways of realization are explicitly assigned. To avoid these limitations, we propose a three layer OWL...
متن کاملExploiting Knowledge Bases for Multilingual and Cross-lingual Semantic Annotation and Search
The amount of entities in large knowledge bases (KBs) has been increasing rapidly, making it possible to propose new ways of intelligent information access. In addition, there is an impending need for systems that can enable multilingual and cross-lingual information access. In this work, we firstly demonstrate X-LiSA, an infrastructure for multilingual and cross-lingual semantic annotation, wh...
متن کاملAutomatic Generation of Multilingual Lexicon by Using Wordnet
A lexicon is the heart of any language processing system. Accurate words with grammatical and semantic attributes are essential or highly desirable for any applicationbe it machine translation, information extraction, various forms of tagging or text mining. However, good quality lexicons are difficult to construct requiring enormous amount of time and manpower. In this paper, we present a meth...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009